DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 27

1
MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
BASE
Show details
2
End-to-end speaker segmentation for overlap-aware resegmentation
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
BASE
Show details
3
Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project
In: ISSN: 1938-4122 ; Digital Humanities Quarterly ; https://hal.archives-ouvertes.fr/hal-03166755 ; Digital Humanities Quarterly, Alliance of Digital Humanities, 2021, Special Issue on AudioVisual Data in DH, 15 (1) ; http://digitalhumanities.org/dhq/ (2021)
Abstract: Editors: Taylor Arnold, Jasmijn van Gorp, Stefania Scagliola, and Lauren Tilton ; International audience ; The ANTRACT project is a cross-disciplinary apparatus dedicated to the analysis of the French newsreel company Les Actualités Françaises (1945-1969) and its film productions. Founded during the liberation of France, this state-owned company filmed more than 20,000 news reports shown in French cinemas and throughout the world over its 24 years of activity. The project brings together research organizations with a dual historical and technological perspective. ANTRACT’s goal is to study the production process, the film content, the way historical events are represented and the audience reception of Les Actualités Françaises newsreels using innovative AI-based data processing tools developed by partners specialized in image, audio, and text analysis.This article focuses on the data processing apparatus and tools of the project. Automatic content analysis is used to select data, to segment video units and typescript images, and to align them with their archival description. Automatic speech recognition provides a textual representation and natural language processing can extract named entities from the voice-over recording; automatic visual analysis is applied to detect and recognize faces of well-known characters in videos. These multifaceted data can then be queried and explored with the TXM text-mining platform.The results of these automatic analysis processes are feeding the Okapi platform, a client-server software that integrates documentation, information retrieval, and hypermedia capabilities within a single environment based on the Semantic Web standards. The complete corpus of Les Actualités Françaises, enriched with data and metadata, will be made available to the scientific community by the end of the project.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]; [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]; [SHS.HIST]Humanities and Social Sciences/History; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS.STAT]Humanities and Social Sciences/Methods and statistics; Audiovisual archives; Automatic alignment of video with text; Automatic face recognition; Automatic speech recognition; Automatic video segmentation; Automatic visual analysis; Contemporary history; Cross-disciplinary approach; Digital Humanities; FaceRec software; Knowledge management; Les Actualités françaises; Media history; Multimedia content annotation; Newsreels; Okapi software; Open-source software; Qualitative text analysis; Quantitative text analysis; Semantic web; Text mining; Textometry; TXM software
URL: https://hal.archives-ouvertes.fr/hal-03166755/file/antract_carrive_al_dhq21_200712.pdf
https://hal.archives-ouvertes.fr/hal-03166755/document
https://hal.archives-ouvertes.fr/hal-03166755
BASE
Hide details
4
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0 ...
BASE
Show details
5
Where are we in Named Entity Recognition from Speech?
In: 12th International Conference on Language Resources and Evaluation (LREC) ; https://hal.archives-ouvertes.fr/hal-02475026 ; 12th International Conference on Language Resources and Evaluation (LREC), May 2020, Marseille, France ; https://aclanthology.org/2020.lrec-1.556/ (2020)
BASE
Show details
6
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
7
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning ...
BASE
Show details
8
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning ...
BASE
Show details
9
Collective memory shapes the organization of individual memories in the medial prefrontal cortex
In: EISSN: 2397-3374 ; Nature Human Behaviour ; https://halshs.archives-ouvertes.fr/halshs-02416130 ; Nature Human Behaviour, Nature Research 2019, ⟨10.1038/s41562-019-0779-z⟩ (2019)
BASE
Show details
10
Effective keyword search for low-resourced conversational speech
In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
BASE
Show details
11
An investigation into language model data augmentation for low-resourced STT and KWS
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01837171 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Mar 2017, New Orleans, United States (2017)
BASE
Show details
12
Language Recognition for Dialects and Closely Related Languages
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
BASE
Show details
13
Language Model Data Augmentation for Keyword Spotting
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
BASE
Show details
14
Investigating techniques for low resource conversational speech recognition
In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016) ; https://hal-univ-lemans.archives-ouvertes.fr/hal-01515254 ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shangai, China. pp.5975-5979, ⟨10.1109/ICASSP.2016.7472824⟩ ; www.icassp2016.org (2016)
BASE
Show details
15
Improving Data Selection for Low Resource STT and KWS
BASE
Show details
16
Investigating Techniques for Low Resource Conversational Speech Recognition
BASE
Show details
17
Improving recognition of proper nouns in ASR through generating and filtering phonetic transcriptions
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 28 (2014) 4, 979-996
OLC Linguistik
Show details
18
Traduction de la parole dans le projet RAPMAT
In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
BASE
Show details
19
Boosting bonsai trees for efficient features combination : application to speaker role identification
In: Interspeech ; https://hal.inria.fr/hal-01025171 ; Interspeech, Sep 2014, Singapour, Singapore (2014)
BASE
Show details
20
Development of a Korean speech recognition system with little annontated data
In: International Workshop on Spoken Languages Technologies for Under-resourced languages ; https://hal.archives-ouvertes.fr/hal-01843405 ; International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St Petersburg, Russia (2014)
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
26
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern